Surprise sampling: Improving and extending the local case-control sampling
نویسندگان
چکیده
Fithian and Hastie (2014) proposed a new sampling scheme called local case-control (LCC) that achieves stability efficiency by utilizing clever adjustment pertained to the logistic model. It is particularly useful for classification with large imbalanced data. This paper proposes more general based on working principle data points deserve higher probability if they contain information or appear "surprising" in sense of, example, error of pilot prediction absolute score. Compared relevant existing schemes, as reported Ai, et al. (2018), one has several advantages. adaptively gives out optimal forms variety objectives, including LCC Ai (2018)'s special cases. Under same model specifications, estimator also performs no worse than those literature. The estimation procedure valid even misspecified and/or inconsistent dependent full We present theoretical justifications claimed advantages optimality design. Different from our sample theory are population-wise rather data-wise. Moreover, approach can be applied unsupervised learning studies, since it essentially only requires specific loss function response-covariate structure needed. Numerical studies carried evidence support shown.
منابع مشابه
Improving Sampling-based Motion Control
We address several limitations of the sampling-based motion control method of Liu et at. [LYvdP∗10]. The key insight is to learn from the past control reconstruction trials through sample distribution adaptation. Coupled with a sliding window scheme for better performance and an averaging method for noise reduction, the improved algorithm can efficiently construct open-loop controls for long an...
متن کاملImproving the Dwivedi Sampling Scheme
Despite recent advances in Monte Carlo rendering techniques, dense, high-albedo participating media such as wax or skin still remain a difficult problem. In such media, random walks tend to become very long, but may still lead to a large contribution to the image. The Dwivedi sampling scheme, which is based on zero variance random walks, biases the sampling probability distributions to exit the...
متن کاملImproving Local Features by Dithering-Based Image Sampling
The recent trend of structure-guided feature detectors, as opposed to blob and corner detectors, has led to a family of methods that exploit image edges to accurately capture local shape. Among them, the WαSH detector combines binary edge sampling with gradient strength and computational geometry representations towards distinctive and repeatable local features. In this work, we provide alterna...
متن کاملSampling strategies in nested case-control studies.
A stratified version of nested case-control sampling which we call "countermatching" is presented. This design uses data available for all cohort members to obtain a sample for collecting additional information in a case-control substudy. Hitherto the only stratified sampling design for such studies has involved matching of controls to cases with respect to confounding variables. However, in so...
متن کاملLocal Unimodal Sampling
One of the simplest ways of optimizing a problem without knowledge of its gradient, is to iteratively and randomly pick a sequence of candidate solutions that improve on the fitness. This paper shows mathematically that to exhaust the optimum of a problem, the search-range from which the candidate solutions are picked, must be decreased during an optimization run. A simple technique for doing t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronic Journal of Statistics
سال: 2021
ISSN: ['1935-7524']
DOI: https://doi.org/10.1214/21-ejs1844